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Title 

A method for obtaining structural information concerning an encoded 
molecule 

5 Technical Field of the invention 

The present invention relates to the method for obtaining structural 
information about an encoded molecule. The encoded molecule Is usually 
produced by a process that comprises the reaction of a plurality of chemical 
entities. The synthesis of the encoded molecule is recorded or programmed 
10 in an Identifier oligonucleotide which is attached to the encoded molecule. 
The structural information obtained by the present method may be used to 
obtain the entire structure of the encoded molecule or a part thereof. 

Background of the invention 

1 5 The generation of molecules carrying new properties remains a challenging 
task. Recently, and number of procedures have been suggested that should 
allow a more efficient generation and screening of a huge number of mole- 
cules. The approach taken may involve the encoding and/or templatlng of 
molecules other than natural blopoiymers and a coupling of the molecules to 

20 respective templates or identifier parts containing information about the reao 
tants that have participated in the formation of the molecule. These ap- 
proaches allow the researcher to generate and screen a huge number of 
molecules at the same time. 

25 In US 5,723,598 it is suggested to prepare libraries of Afunctional molecules, 
in which one part of the Afunctional molecule comprises an encoded part and 
the other part of the molecule contains an identifying part The identifying part 
is segregated into codons, i.e. a stretches of nucleotides, which codes for 
reactants that have been involved In the synthesis of the encoded molecule. 

30 The libraries of Afunctional molecules are generally prepared by a spllt-and- 
rnix method, which involves the inrtial reaction between a nascent Afunctional 
molecule and a range of different reactants in separate compartments at one 
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end of the nascent bifunctional molecule and a corresponding range of Identi- 
fier unit oligonucleotides (codons) and the other end. Subsequently, the con- 
tents of the compartments are mixed and the mixture is disposed in separate 
compartments and reacted aggin with another range of reactants jand corre- 
5 spending codons. Following the generation of a library of the bifunctional 

molecules, a partitioning with respect to affinity towards a target is conducted 
and the identifier part of the bifunctional molecule is decoded to establish the 
chemical structure of the compounds in the library that is likely to p& a Hgand 
to the target. The decoding step implies that the identifier oligonucleotides 
10 initially are amplified by PGR. The PGR product is subsequently incorporated 
in to a suitable vector which is transformed to a host organism, usually E. 
coll. Following the incubation of the E. coli, colonies are picked and se- 
v quenced. 

15 Halpin and Harbury have in WO 00/23458 suggested an improvement to tine 
approach stipulated above. The approach is based on the same split-and-mbc 
strategy for synthesis of combinatorial libraries comprising two or more syn- 
thetic steps. A plurality nucleic add templates are used, each having at one 
end a chemical reactive site and dispersed throughout the stand a plurality of 

20 codon regions, each of said codon regions in turn specifying different codons. 
Separately, each of the strands, identified by a first codon region vits reacted 
at the chemical reaction sites with specific selected reagents. Subsequently, 
ail the strands are pooled and subjected to a second partitioning tfased on a 
second codon region. The split-and-combine method is conducted an appro- 

25 priate number of times to produce a library of typically between 1 0 3 and 1 0 6 
different compounds. The decoding is performed utilizing the process de- 
picted above. 

Recently, new a method for encoding molecules has been suggested, which 
*' 4 30 can be performed in a single "pof . WO 02/0041 0 and WO 02/103008 dis- 
close methods for preparing virtually any molecule connected to a template 
coding for chemical entities which have reacted to form the molecule. In 
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short, a template segregated into a plurality of codons and a plurality of build- 
ing blocks comprising a transferable chemical entity and an anticodon are 
Initially provided. Under hybridisation conditions, the template and building 
blocks are annealed together and the chemical entities are subsequently re- 
5 acted to form the molecule. However, after a sufficient number of rounds of 
selections have been performed, the tomplato must bo decoded to establish 
the identity of the encoded molecule. The decoding step implies that the 
template oligonucleotides Initially are amplified by PGR. The PGR product is 
subsequently incorporated In to a suitable vector which is transformed to a 
10 host organism, usually E. coll. Following the incubation of the E. fcoll, colonies 
are picked and sequenced. 

In an aspect of the invention, it Is the object to facilitate the decoding of the 
coding oligonucleotide in order to obtain at least partial structural information 
15 of the encoded molecule being a ligand towards a target In another aspect of 
the invention, it is desired to obtain information about which chemical entities 
that result in encoded molecules successful In a selection process. Such 
chemical entities may be used In the formation of a second generation li- 
brary. 

20 

Summary of the Invention 

The present Invention concerns a method for obtaining structural Information 
about an encoded molecule produced by a process comprising reaction of a 
plurality of chemical entities, said encoded molecule being capable of forming 

25 part of a complex also comprising an identifier oligonucleotide containing 
codons Informative of the identity of chemical entities which have participated 
in the formation of the encoded molecule, the method comprises mixing a 
primer oligonucleotide with the identifier oligonucleotide, subjecting the mix- 
ture to a condition allowing for an extension reaction to occur when the 

30 primer is sufficient complementary to a part of the identifier oligonucleotide, 
and evaluating, based on measurement of the extension reaction, the pres- 
ence, absence, or relative abundance of one or more codons. 
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The method according to the Invention may be performed on a single Identi- 
fier oligonucleotide or a composition of identifier oligonucleotides to obtain 
structural information about the encoded molecule or a composition of en- 
5 coded molecules, respectively, that have been attached to the identifier oli- 
gonucleotlde(s). 

A single identifier may be analysed using the<above method to verify the 
Incorporation into the encoded molecule of one or more chemical entitles or 

10 to deconvolute the identity of the entire encoded molecule. A composition of 
two or more identifier oligonucleotides generally results from a selection 
process, i.e. a process involving subjecting a library of different complexes to 
a condition partitioning the composition from the remainder of tha.libraiy. 
Usually the partitioning condition includes an affinity assay in which the 

1 5 library of complexes is contacted with a target and the identifier . 
oligonucleotides of the binding complexes are harvested. 

The conditions allowing for an extension reaction to occur may be selected 
from a enzymatic or chemical means. Suitably, the condition involves one or 

20 more enzymes. In a certain embodiment of the Invention, the condition which 
allows for an extension reaction to occur includes a polymerase or a ligase 
as well as suitable substrates for the enzyme used. Preferred Is a poly- 
merase together with a blend of (deoxy)ribonuc!eotide triphosphates. Suita- 
bly, the blend include one or more of dATP, dGTP, dCTP, and dTTP. 

25 ■■(. 

A library of complexes can have any appropriate size. Typically, the size is 
above 10 3 , typically above 10 6 different complexes. An effective. Extensive, 
and rapid decoding is thereforeidesirable. The method of the present inven- 
tion may be used at various stages of the process of finding a ligand to a cer- 

30 tain target As examples, the method of the Invention may be used for con- 
trolling the quality of a starting Irbrary. The information acquired may be used 
to verify which codons being present, absent, and. in some embodiments, 
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also the relative abundance. Thus, the method of the invention delivers a re- 
liable picture of the process which has produced the library. If, for.some rea- 
son, a chemical entity has not been incorporated into the encoded molecules, 
the absence of a codon for this chemical entity will In certain embodiments of 
5 the Invention indicate this fact . 

Another example of the use of the present method is following the selection. 
After the selection has been performed the cOdon profile is indicative of the 
chemical entities that have been used in the synthesis of encoded molecules 

10 having an affinity towards the target In the event the selection has been suf- 
ficient effective it may be possible directly to deduce a part or the entire struc- 
ture of binding encoded molecules. Alternatively. It may be possible to de- 
duce a structural unit appearing more frequently among the encoded mole- 
cules after the selection, which gives important information to the structure- 

15 activity-relationship (SAR). If the selection process has not narrowed the size 
of the library to a manageable number, , the formation of a second generation 
library may be contemplated. In the formation of the second generation li- 
brary chemical entities which have not been involved In the synthesis of en- 
coded molecules that have been successful in the selection may be omitted, 

20 thus limiting the size of the new library and at the same time increasing the 
concentration of binding complexes. The second generation library may then 
be subjected to more stringent selection conditions to allow only the encoded 
molecules with a higher affinity to bind to the target The second generation 
library may also be spiked with certain chemical entities suspected of In- 

25 creasing the performance of the final encoded molecule. The indication of 
certain successful chemical entities may be obtained from the SAR. The use 
in a seoond generation library of chemical entities, which have proved to be 
interesting for further investigation in a preceding library, may thus entail a 
shuffling with new chemical entities that may focus the second generation 

30 library in a certain desired direction. ; 1 
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The relative abundance of codons may make it possible to decode a plurality 
of identifiers simultaneously, even in th'e case when two or more^entlflers 
contain the same codon. Thusjfollowing the formation and selection of a first, 
second or further generation library, the Identity of binding encoded mole- 
5 cufes may be partly or entirely deconvolved by the present method. 

f « 

In a practical approach the library comprises complexes with identifier oli- 
gonucleotides having n codon positions eachVIn a certain-aspect of the in- 
vention n is an integer independently selected from of 2 to 8. It may be pre- 

1 0 famed that n is constant among all the complexes In the library to facilitate the 
decoding process. Each of the radons in a certain position is in an aspect of 
the invention selected from a set of m different codons. m may vary for each 
* codon position or may be constant among the various codon positions. It may 
be preferred in some embodiments to have all the codons in each position 

15 selected from the same set of m codons. However, in other embodiments, 
especially such involving hybridisation in the recognition between ithe codon 
and the anticodon, it may be preferred that all the codons are different 

t M 
Any member of the codon set differs from any other codons in the* set with 

20 the identity of at least one nucleotide, i.e. at least one nucleotide position oc- 
curs. In some aspects of the invention it is preferred that any member of the 
codon set differs with at least two nucleotides nucleotide positions from any 
other member of the set to increase the fidelity of the method. In general, it is 
desired to maximize the differences between individual codons of the set In 

25 some embodiments of the invention, a set of primers comprising a sequence 
of complementing the set of codons are prepared. 

In a preferred aspect of the method a framing sequence is related to each of 
the n codon positions in a particular complex, said framing sequence posl- 
30 tions the reaction of a chemical entity in the synthesis history of the encoded 
molecule. Typically, the framing sequence is identical among the'complexes. 
for each of the reaction rounds and is selected from a group of n 'different 

! -I 
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nucleotide sequences. In a certain aspect of the Invention n x m different 
primers fully or in part complementing any combination of the set of m differ- 
ent codons and the set of n different framing sequences Is prepared. The n x 
m primers may be used in separate compartments to reveal the identity of a 

- s ' it 

5 chemical entity as well as the point In time of the synthesis of the encoded 
molecule Is has reacted. 

. ! i 

In a particular aspect, the invention concerns v a method for identifying the 
chemical entities utilized in the formation of an encoded molecule or a com- 

10 position of encoded molecules, wherein in separate compartments, n x m 
primers individually are mixed with an aliquot of a composition obtained by 
subjecting a library of different complexes to a condition partitioning said 
composition from the remainder of the library, subjected to a mixture of poly- 
merase and substrate (deoxy)ribonucleotfde triphosphates under conditions 

1 5 allowing tor an extension reaction to occur when a primer Is sufficient com- 
plementary to a part of one or more identifier oligonucleotides present in the 
aliquot, and evaluation, based on measurement of the extension reaction, the 
presence, absence, or relative abundance of one or more codons in each 
compartment 

20 ! • i- 

•» 

The invention also concerns a set comprising a collection of oligonucleotide 
primers, a polymerase, a composition of (deoxy)ribonucleotide triphosphates 
(dNTPs). and a library of complexes composed of a display molecule part 
and an identifier oligonucleotide, said oligonucleotide comprising codons in- 
25 formative of the identity of the chemical entities which has participated in the 
formation of the display molecule, wherein the oligonucleotide primers are 
sufficiently complementary to codons appearing on the identifier oligo nucleo- 
tides in the library to allow for an extension to occur. 
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Detailed Description of the Invention 

Complex 

The complex comprises an encoded molecule and an Identifier oligonucleo- 
tide. The identifier comprises codons that identify the encoded molecule. 
5 Preferably, the identifier oligonucleotide identifies the encoded molecule 
uniquely, i.e. in a library of complexes a particular identifier is capable of dis- 
tinguishing the molecule it is attached to from the rest of the molecules. 

• ! ' ! 

The encoded molecule and the identifier may be attached directly to each 
10 other or through a bridging moiety. In one aspect of the invention, the bridg- 
ing moiety is a selectively deavable linkage. 

The identifier oligonucleotide may comprise two or more codons. In a pre- 
ferred aspect the identifier oligonucleotide comprises three or more codons. 
1 5 The sequence of each codon can be decoded utilizing the present method to 
identify reactants used in the formation of the encoded molecule. When the 
Identifier comprises more than one codon, each member of a pool of chemi- 
cal entities can be identified and the order of codons is informative of the syn- 
thesis step each member has been incorporated in. 

20 

In a certain embodiment, the same codon is used to code for several different 
chemical entities. In a subsequent identification step, the structure of the en- 
coded molecule can be deduced taking advantage of the knowledge of dif- 
ferent attachment chemistries, steric hindrance, deprotection of orthogonal 
25 protection groups, etc. In another embodiment the same codon is used for a 
group of chemical entities having a common property, such as a lipophilic 
nature, a certain attachment chemistry etc. In a preferred embodiment, how- 
ever, the codon is unique i.e. a similar combination of nucleotides does not 
appear on the Identifier oligonucleotide coding for another chemical entity. In 
. 30 a practical approach, for a specific chemical entity, only a single combination 
of nucleotides is used. In some aspects of the invention, it. may be advanta- 
geous to use several codons for the same chemical entity, much in the same 
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way as Nature uses up to six different codons for a single amino acid. The 
two or more codons identifying the same chemical entity may carry further 
information related to different reaction conditions. 

5 The sequence of the nucleotides in each codon may have any suitable 

length. The codon may be a single nucleotide or a plurality of nucleotides. In 
some aspects of the invention, it is preferred that each codon Independently 
comprises four or more nucleotides! more preferred 4 to 30 nucleotides. In 
some aspects of the Invention the lengths of the codons vary. . ' 

10 

A certain codon may be distinguished from any other codon In the library by 
only a single nucleotide. However, to facilitate a subsequent decoding proc- 
ess and to increase the ability of the primer to discriminate between codons it 
Is In general desired to have two or more mismatches between a particular 

15 codon and any other codon appearing on identifier oligonucleotide. As an 
example, if a codon length of 5 nucleotides is selected, more than 100 nu- 
cleotide combinations exist in which two or more mismatches appear. For a 
certain number of nucleotides in the codon, it is generally desired to optimize 
the number of mismatches between a particular codon relative to any other 

20 codon appearing in the library. 

The identifier oligonucleotide will in general have at least two codons ar- 
ranged in sequence, i.e. next to each other. Two neighbouring codons may 
be separated by a framing sequence. Depending on the encoded, molecule 
25 formed, the identifier may comprise further codons, such as 3, 4, 5, or more 
codons. Each of the further codons may be separated by a suitable framing 
sequence. Preferably, all or at least a majority of the codons of the identifier 
are separated from a neighbouring codon by a framing sequence! The fram- 
tng sequence may have any suitable number of nucleotides, e.g. 1 to 20. Al- 
. 30 tentatively, codons on the identifier may be designed with overlapping se- 
quences. 
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The framing sequence, if present, may serve various purposes. In one setup 
of the invention, the framing sequence identifies the position of the codon. 
Usually, the framing sequence either upstream or downstream of a codon 
comprises information which positions the chemical entity and the reaction 
5 conditions in the synthesis history of the encodedmolecule. The framing se- 
quence may also or in addition provide for a region of high affinity. The high 
affinity region may ensure that a hybridisation event with an anti-codon will 
occur in frame. Moreover, the framing sequence may adjust the annealing 
temperature to a desired level. 

10 

A framing sequence with high affinity can be provided by incorporation of one 
or more nucleobases forming three hydrogen bonds to a cognate nucleo- 
base. Examples of nucleobases having this property are guanine; and cyto- 
sine. Alternatively, or in addition, the framing sequence may be subjected to 
15 backbone modification. Several back bone modifications provides for higher 
affinity, such as 2*-0-methyl substitution of the ribose moiety, peptide nucleic 
acids (PNA) f and 2*-4 % O-methylene cydisation of the ribose moiety, also re- 
ferred to as LNA (Locked Nucleic Acid), 

20 The sequence comprising a codon and an adjacent framing sequence has In 
a certain aspect of the invention a total length of 1 1 nucleotides or more, 
preferably 15 nucleotides or more. A primer may be designed to complemen- 
tary to the codon sequence as well as the framing sequence. The presence 
of an extension reaction under conditions allowing for such reaction to occur 

25 is indicative of the presence of the chemical entity encoded in the codon as 
well as the position said chemical entity has in the entire synthesfs history of 
the encoded molecule. ? 

The identifier may comprise flanking regions around the coding section. The 
30 flanking regions can also serve as priming sites for amplification reactions, 
such as PCR or as binding regipn for oligonucleotide probe. The!fdentifier 
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may in certain embodiments comprise an affinity region having the property 
of being able to hybridise to a building block. j 

tt is to be understood that when the term Identifier oligonucleotide is used in 
5 the present description and claims, the identifier oligonucleotide may be in 
the sense or the anti-sense format, i.e. the identifier can be a sequence of 
codons which actually codes for the encoded molecule or can be a sequence 
complementary thereto. Moreover, the identifier may be single-stranded or 
double-stranded, as appropriate. 

10 

The encoded molecule part of the complex is generally of a structure ex- 
pected of having an effect on a target When the target is of pharmaceutical 
Importance, the encoded molecule is generally a possible drug candidate. 
The complex may be formed by tagging a library of different possible drug 

15 candidates with a tag, e.g. a nucleic acid tag identifying each possible drug 
candidate. In another embodiment of the invention, the molecutetformed by a 
variety of reactants which have reacted with each other and/or a scaffold 
molecule. Optionally, this reaction product may be post-modified to obtain the 
final molecule displayed on the complex. The post-modification may involve 

20 the cleavage of one or more chemical bonds attaching the encoded molecule 
to the Identifier in order more efficiently to display the encoded molecule. 

The formation of an encoded molecule generally starts by a scaffold, i.e. a 
chemical unit having one or more reactive groups capable of forming a 

25 connection to another reactive group positioned on a chemical entity, thereby 
generating an addition to the original scaffold. A second chemical entity may 
react with a reactive group also appearing on the original scaffold or a 
reactive group incorporated by»the first chemical entity. Fgrther chemical 
entities may be involved in the formation of the final reaction product The 

30 formation of a connection between the chemical entity and the nascent 

encoded molecule may be mediated by a bridging molecule. As an example, 
if the nascent encoded molecule and the chemical entity both comprise an 



12 

amine group a connection between these can be mediated by a dfcarboxylic 
acid, A synthetic molecule is in general produced In vitro and rna^ be a 
naturally occurring or an artificial substance. Usually, a synthetic rholecule is 
not produced using the natural^ translation system in an in vitro ptocess. 
5 I '•! 

The chemical entitles that are phe cursors for structural additions or 
eliminations of the encoded molecule may be attached to a building block 
prior to the participation in the formation of th& reaction product leading the 
final encoded molecule. Besides the chemical entity, the building block 
10 generally comprises an anti-codon. In some embodiments the building blocks 
also comprise an affinity region providing for affinity towards the nascent 
complex 

Thus, the chemical entitles are suitably mediated to the nascent encoded 
15 molecule by a building block, which further comprises an anticodon. The anti- 
codon serves the function of transferring the genetic information of the 
building block In conjunction with the transfer of a chemical entity iThe 
transfer of genetic information and chemical entity may occur in any order. 
The chemical entities are preferably reacted without enzymatic interaction in 
20 some aspects of the invention. Notably, the reaction of the chemical entities 
is preferably not mediated by ribosomes or enzymes having simildr activity. 
In other aspects of the invention, enzymes are used to mediate the reaction 
between a chemical entity and a nascent encoded molecule. 

25 According to certain aspects of the invention the genetic information of the 
anti-codon is transferred by specific hybridisation to a codon on a nucleic . 
acid template. Another method for transferring the genetic Information of the 
anti-codon to the nascent complex is to anneal an oligonucleotide 
complementary to the anti-codon and attach this oligonucleotide to the 

30 complex, e.g. by ligation. A still further method involves transferring the 

genetic information of the anti-cddon to the nascent complex by an extension 

reaction using a polymerase and a mixture of dNTPs. ! 

; t:l 
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The chemical entity of the building block may in most cases be regarded as a 
precursor for the structural entity eventually incorporated into the ^ncoded 
molecule. In other cases the chemical entity provides for the elimlpations of 

5 chemical units of the nascent encoded molecule. Therefore, whep It in the 
present application wilh claims js slated thaL a chemical entity is transferred 
to a nascent encoded molecule, it is to be understood that not necessarily all 
the atoms of the original chemigal entity is to^Be found in the eventually 
formed encoded molecule. Also, as a consequence of the reactions involved 

10 in the connection, the structure of the chemical entity can be changed when it 
appears on the nascent encoded molecule. Especially, the cleavage resulting 
in the release of the entity may generate a reactive group which in a subse- 
quent step can participate in the formation of a connection between a nas- 
cent complex and a chemical entity. 

15 

The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemipal entity or 
a scaffold associated with the nascent complex The number of reactive 

20 groups which appear on the chemical entity is suitably one to teni]A building 
block featuring only one reactive group is used t.a. in the end posijtlons of 
polymers or scaffolds, whereas building blocks having two reactive groups 
are suitable for the formation of;the body part of a polymer or scaffolds capa- 
ble of being reacted further. One, two or more reactive groups intended for 

25 the formation of connections, are typically present on scaffolds. Non-limiting 
examples of scaffolds are opiates, steroids, benzodiazepines, hydantoines, 
and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct 
30 connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be urn 

i \ 
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derstood that not all the atoms of a reactive group are necessarily maintained 
In the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection, 

5 The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a chemical reagent or an enzyme. The 
cleavage results in a transfer of the chemical "entity to the nascent encoded 
molecule or in a transfer of the nascent encoded molecule to the'dhemical 

1 0 entity of the building block. In some cases it may be advantageous to intro- 
duce new chemical groups as a consequence of linker cleavage. The new 
chemical groups may be used for further reaction in a subsequent cycle, ei- 
ther directly or after having been activated. In other cases it is desirable that 
no trace of the linker remains after the cleavage. 

15 

In another aspect, the connection and the cleavage is conducted as a simul- 
taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In some as- 
pects of the invention, it Is appropriate to design the system such that the 

20 connection and the cleavage occur simultaneously because this will reduce 
the number of steps and the complexity. The simultaneous connection and 
cleavage can also be designed such that either no trace of the linker remains 
or such that a new chemical group for further reaction Is introduced, as de- 
scribed above. I '! 

25 j 

The attachment of the chemical: entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucieobase or the backbone. In general, it is 
preferred to attach the chemical entity at the phosphor of the intemucleoside 

30 linkage or at the nucieobase. When the nucieobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
purines or 7-deaza-purins or at the 5 position of pyrimidines. The nucleotide 



.. 2UU3 TOR 09:30 VAX -J 

• *\ 
! 

15 

may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational spaced 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule. 

5 

The encoded molecules may have any chemical structure. In a preferred 
aspect, the encoded molecule ran be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the 
synthetic molecule is a linear or branched polymer. In another aspect the 
10 synthetic molecule is a scaffolded molecule. The term "encoded molecule" 
also comprises naturally occurring molecules like a-polypeptides etc, 
however produced In vitro usually in the absence of enzymes, like ribosomes. 
In certain aspects, the synthetic molecule of the library Is a non-a- 
polypeptide. 

15 

The encoded molecule may have any molecular weight. However, in order to 
be orally available, it is in this case preferred that the synthetic molecule has 
a molecular weight less than 2000 Daltons, preferably less than 1000 Dalton, 
and more preferred less than 500 Daltons. 

20 

The size of the library may vary considerably pending on the expected result 
of the Inventive method. In some aspects, it may be sufficient that the library 
comprises two, three, or four different complexes. However, in most events, 
more than two different complexes are desired to obtain a higher diversity. In 
25 some aspects, the library comprises 1 ,000 or more different complexes, more 
preferred 1 ,000,000 or more different complexes. The upper limit for the size 
of the library Is only restricted by the size of the vessel in which the library is 
comprised. It may be calculated that a vial may comprise up to 10f 4 different 
complexes. 

.30 
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Extension r^cflon : ' 
The extension reaction requires a primer, a polymerase as well a? a collec- 
tion of deoxyribonucleotide triphosphates (abbreviated dNTP's herein) to 
proceed. An extension product may be obtained in the event the primer is 
5 sufficient complementary to an identifier oligonucleotide for a polymerase to 
recognise the double helix as a substrate. After binding of the polymerase to 
the double helix, the deoxyribonucleotide triphosphates (blend of dATP, 
dCTP, dGTP. and dTTP) are incorporated into the extension product using the 
identifier oligonucleotide as template. The conditions allowing for the exten- 

10 sion reaction to occur usually includes a suitable buffer. The buffer may be 
any aqueous or organic solvent or mixture of solvents in which the poly- 
merase has a sufficient activity. To facilitate the extension process the poly- 
merase and the mixture of dNTP's are generally included in a buffer which is 
added to the identifier oligonucleotide and primer mixture. An exemplary kit 

1 5 comprising the polymerase and the nNTP*s for performing the extension 
process comprises the following: 50 mM KCI; 10 mM Trls-HCl atpH 8.3; 1.5 
mM MgCl2 ; 0.001% (wi/vol) gelatin, 200 pM dATP; 200 pM dTTB; 200 pM 
dCTP; 200 pM dGTP; and 2.5 units Thermus aquaticus (Taq) DNA poly- 
merase I (U.S. Pat No. 4,889,818) per 100 microliters (pi) of buffer, 

20 

The primer may be selected to be complementary to one or more codons or 
parts of such codons. The length of the primers may be determined by the 
length of the codons, however, the primers usually are at least about 1 1 nu- 
cleotides in length, more preferred at least 15 nucleotides In length to allow 
25 for an efficient extension by the? polymerase. The presence or absence of one 
or more codons is indicated by the presence of or absence of an extension 
product The extension product may be measured by any suitable! method, 
such as size fractioning on an agarose gel and staining with ethidium bro- 
mide. ; i i 

.30 i « 

In a preferred embodiment the admixture of identifier oligonucleotide and 
primer Is termocycled to obtain a sufficient number of copies of thfe extension 



product The thermocycling is typically carried out by repeatedly increasing 
and decreasing the temperature of the mixture within a temperature range 
whose lower limit Is about 30 degrees Celsius (30°C) to about 55? C and 
whose upper limit is about 90°C to about 100° C. The increasing pnd de- 
5 creasing can be continuous, but is preferably phasic with time periods of rela- 
tive temperature stability at each of temperatures favouring polynucleotide 
synthesis, denaturation and hybridization. 

When a single complex is analysed In accordance with the present method, 
10 the result may be used to verily the presence or absence of a specific chemi- 
cal entity during the formation of the display molecule. The formation of an 
extension product is indicative of the presence of an oligonucleotide part 
complementary to the primer in the Identifier oligonucleotide. Conversely, the 
absence of an extension product is indicative of the absence of an ollgonu- 
1 5 cleotide part complementary to, the primer in the identifier oligonucleotide. 
Selecting the sequence of the primer such that it is complementary to one or 
more codons will therefore provide information of the structure of ithe en- 
coded molecule coded for by this codon(s). ! 

i 

20 In a preferred aspect of the invention, in the mixture of the identifier oligonu- 
cleotide and the primer oligonucleotide, a second primer complementary to a 
sequence of the extension product is included. The second primer is also 
termed reverse primer and ensures an exponential increase of the number of 
produced extension products. The method using a forward and reverse 

25 primer Is well known to skilled person in the art and is generally referred to as 
polymerase chain reaction (abbreviated PCR) in the present application with 
claims. In one embodiment of the Invention the reverse primer is annealed to 
a part of the extension product downstream, i.e. near the 3 'end of the exten- 
sion product, or a part complementing the coding part of the Identifier oti- 

30 gonucleotkle. In another embodiment^ the first primer (forward primer) an- 
neals to an upstream position ol the identifier oligonucleotide, preferably be- 

i I 1 
♦ 
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fore the coding part, and the reverse primer anneals to a sequence of the 
extension product complementing one or more codons or parts thereof. 

•i 

The amplicons resulting from die PCR process may be stained during or fol- 
5 lowing the reaction to ease the detection. A staining after the PCR process 
may be prepared with e.g. ethldlum bromide or a similar staining sgent. As 
an example, amplicons from the PCR process is run on an agarose gel and 
subsequently stained with ethidium bromide. Xlnder UV illumination bands of 
amplicons becomes visible. It is possible to incorporate the staining agent in 
1 0 the agarose gel or to allow a solution of the staining agent to migrate through 
the gel. The amplicons may also be stained during the PCR process by an 
intercalating agent, like CYBR. In presence of the Intercalating agent while 
the amplification proceeds it will Incorporate in the double helix. The interca- 
lation agent may then be made visible by irradiation by a suitable source. 

15 

The intensity of the staining is Informative of the relative abundance of a spe- 
cific amplicon. Thus, it is possible to quantify the occurrence of a codon in an 
identifier oligonucleotide. Wherta library of bifunctional complexes has been 
subjected to a selection the codons in the pool of identifier oligonucleotides 

20 which has been selected can be quantified using this method. As! &n example 
a sample of the selected identifier oligonucleotides is subjected to- various 
PCR amplifications with different primers In separate compartments and the 
PCR product of each compartment is analysed by electrophoresis in the 
presence of ethidium bromide. The bands that appear can be quantified by a 

25 densitometry analysis after irradiation by ultraviolet light and the relative 
abundance of the codons can be measured. 

Alternatively, the primers may be labelled with a suitable small molecule, like 
biotin or dtgoxlgenin. A PCR-EL1SA analysis may subsequently be performed 
30 based on the amplicons comprising the small molecule. A preferred method 
includes the application of a solid support covered with streptavidln or avidin 
when btotin is used as label and anti-digoxigenin when digoxigenirt is used as 

* ' "\ 
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the label. Once captured, the amplicons can be detected using an enzyme- 
labelled avidin or anthdixigenin reporter molecule similar to a standard ELISA 
format. 

5 To avoid laborious post-PCR handling steps required to evaluate the ampli- 
cons, H is In a certain embodiment preferred lo measure If le extension proc- 
ess "real time 0 . Several real time PCR processes has been developed and all 

i • I 

the suitable real time PCR process available lib the skilled person in the art 
can be used in the evaluating step of the present invention and ana Include in 
1 0 the present scope of protection! The PCR reactions discussed below are of 
particular interest. 

The monitoring of accumulating amplicons In real time has been made possi- 
ble by labelling of primers, probes, or amplicons with fluorogenic molecules. 

1 5 The real time PCR amplification is usually performed with a speed faster than 
the conventional PCR, mainly due to reduced cycles time and the use of 
sensitive methods for detection of emissions from the fluorogenic labels. The 
most commonly used fluorogenic oilgoprobes rely upon fluorescent reso- 
nance energy transfer (FRET) between fluorogenic labels or between one 

20 flourophor and a dark or "black-hole* non-fluorescent quencher (NFQ), which 
disperse energy as heat rattier than fluorescence. FRET Is a spectroscopic 
process by which energy is passed between molecules separate^ by 10-100 
A that have overlapping emission and absorption spectra. An advantage of 

many real time PCR methods Is that they can be canted out in a closed sys- 

■j 

25 tern, i.e. a system which does not need to be opened to examine the result of 
the PCR. A closed system implies a reduced result turnaround, minimisation 
of the potential for carry-over contamination and the ability to closely scruti- 
nise the essay's performance. 

30 The present real time PCR methods currently available to the skilled person 
can be classified into either amplicon sequence specific or non-specific 
methods. The basis for the non-specific detection methods is a DNA-btnding 



20 

fluorogenic molecule. Included in this class are the earfiest and simplest ap- 
proaches to real time PCR. Ethldlum bromide, YOPRO-1, and SYBR* green 
1 all fluorescence when associated with double stranded DNA which Is ex- 
posed to a suitable wavelength of light This approach requires the fluores- 
5 cent agent to be present during the PCR process and provides for a real time 
detection of the fluorescent agent as it Is incorporated into the double 
Stranded helix. 

' i 

The ampllcons sequence specific methods Includes, but are not limited to, 

10 the TaqMan®. hairpin, LlghtCyclei* Sunrise®, and Scorpion® methods. The 
LlghtCydei® method also designated "HybProbes" make use of a pair of ad- 
jacent fluorogenic hybridisation oligonucleotide probes. A first usually the 
upstream oiigoprobe is labelled with a 3' donor fluorophore and the;second, 
usually the downstream probe is commonly labelled with either a Light cycler 

1 6 Red 640 or Red 705 acceptor fluorophore a the 5' terminus so that when 
both oligoprobes are hybridised the two fluorophores are located in close 
proximity, such as within 10 nm, of each other. The dose proximity provides 
for the emission of a fluorescence when irradiated with a suitable light sour- 
ce, such a blue diode In case of the UghtCycJer^. The region for annealing of 

20 the probes may be any suitable position that does not interfere with the 

primer annealing. In a suitable setup, the site for binding the probes are posi- 
tioned downstream of the codon region on the Identifier oligonucleotide. Al- 
ternatively, when a reverse primer is used, the region for annealing the 
probes may be at the 3' end of the strand complementing the identifier oli- 

25 gonucleotide. Another embodiment of the LightCycler method indoles that 
the pair of oligonucleotide probes are annealed to one or more codpns and 
primer sites exterior to the codihg part of the Identifier oligonucleotide are 
used for PCR amplification. ! 

c 

30 The TaqMan® method, also referred to as the 5* nuclease or hydroiysis 
method, requires an oiigoprobe. which is attached to a reporter flouraphor, 
such as 6-carboxy-fluoroscein. and a quencher fluorophore, such as 6- 
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carboxy-tetramethyl-rhodamine, at each end. When in close proxltalty, i.e. 
annealed to an identifier oligonucleotide, or a sequence complementing the 
Identifier oligonucleotide, the quencher will "hijack" the emissions that have 
resulted from the excitation of the reporter. As the polymerase progresses 

5 along the relevant strand, It displaces and the hydropses the ollgoprobe via 
its endonudease activity. Once the reporter Is iemoved from the extin- 
guishing influence of the quencher, It is able to release excitation energy at a 
wavelength that can be monitored by a suitaBJe Instrument, such as ABI 
Prism® 7700, The fractional cycle number at which the real-time fluorescence 

10 signal mirrors progression of the reaction above the background noise is 
normally used as an indicator of successful identifier oligonucleotide amplifi- 
cation. This threshold cycle (C T ) Is defined as the PCR cycle in which the 
gain in fluorescence generated 'by the accumulating amplicons exceeds 10 
standard deviations of the mean base line fluorescence. The C T Is proper- 

1 5 tional to the number of identifier oligonucleotide copies present in the sample. 
The TaqMan probe is usually designed to hybridise at a position downstream 
of a primer binding site, be it a forward or a reverse primer. When the primer 
is designed to anneal to one or more eodons of the identifier oligonucleotide, 
the presence of these one or more codons is indicated by the emittance of 

20 light Furthermore, the quantity of the identifier oligonucleotides comprising 
the one or more codons may be measured by the C T value. 

The Hairpin method involves an ollgoprobe, in which a fluorophore and a 
quencher are positioned at the termini. The labels are hold in close proximity 

25 by distal stem regions of homologous base pairing deliberately designed to 
create a hairpin structure which result in quenching either by FRET or a di- 
rect energy transfer by a colllsional mechanism due to the intimate proximity 
of the labels. When direct energy transfer by a collision mechanism is used 
the quencher is usually different from the FRET mechanism, and is suitably 

30 ^-(^-dimethylamino-phenylazoj-benzene (DABCYL). In the presence of a 
complementary sequence, usually downstream of a primer, or within the 
bounds of the primer binding sides In case of more than one a single primer. 
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the ollgoprobe will hybridise, shifting into an open configuration, the fluoro- 
phore is now spatially removed ifrom the quencher's influence and! fluores- 
cence emissions are monitored) during each cycle. In a certain aspect, the 
hairpin probe may be designed <to anneal to a codon in order to detect this 
5 codon If present on the identifier oligonucleotide* This embodiment may be 
suitable if codons only differs from each other with a single or a few nucleo- 
tides, because is in well-known that the occurrence of a mismatch between a 
hairpin oligoprobe and its target sequence has a greater destabilising effect 
on the duplex than the introduction of an equivalent mismatch between the 
1 0 target oligonucleotide and a linear oligoprobe* This is probably because the 
hairpin structure provides a highly stable alternate conformation. 

The Sunrise and Scorpion methods are similar in concept to the hairpin oll- 
goprobe, except that the label becomes irreversible incorporated lih to the 

15 PGR product The Sunrise metftod involves a primer (commercially available 
as AmpHfluor™ hairpin primers) comprising a 5' fluorophore and a 1 quencher, 
e.g. DABCYL The labels are separated by complementary stretcftes of se- 
quence that create a stem whefr the sunrise primer Is closed- At the 3' termi- 
nus is a target specific primer sequence. In a preferred embodiment the tar- 

20 get sequence Is a codon , optionally more codons. The sunrise primer's se- 
quence Is intended to be duplicated by the nascent complementary stand 
and, In this way, the stem is destabilised, the two fluorophores are held apart, 
usually between 15 and 25 nucleotides, and the fluorophore is free to emit its 
excitation energy for monitoring. The Scorpion primer resembles the sunrise 

25 primer, but derivate in having a moiety that blocks duplication on the signal- 
ling portion of the scorpion primer. The blocking moiety is typically hexethyl- 
ene glycol. In addition to the difference in structure, the function of the scor- 
pion primers differs slightly in that the 5' region of the oligonucleotide Is de- 
signed to hybridise to a complementary region within the amplicoris. In a cer- 

30 tain embodiment the complementary region is a codon on the identifier oli- 
gonucleotide. The hybridisatiortforces the labels apart disruptingithe hairpin 
and permitting emission in the same way as the hairpin probes. • i 
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Methods for forming a library of complexes ii 
The complexes comprising an Identifier part having two or more codons that 
codes for reactants that have reacted in the formation of the encoded mole- 
5 cule part of the complex may be formed by a variety of processes: Generally, 
the preferred methods can be used for the formation of virtually any kind of 
encode molecule. Suitable examples of processes include prior art methods 
disclosed In WO 93/20242, WO 93/06121. Wb 00/23458, WO 02/074929, 
and WO 02/103008, the content of which being incorporated herein by refer- 

1 0 ence as well as methods of the present applicant not yet public available, 
including the methods disclosed in DK PA 2002 01955 filed 19 December 
2002 p and DK PA 2003 00430 filed 20 March 2003. Any of these methods 
may be used, and the entire content of the patent applications are included 
herein by reference. 

15 i 
Below four preferred embodiments are described. A first embodiment 
disclosed in more detail in WO p2/1 03008 is based on the use ofi& 
polymerase to incorporate unnatural nucleotides as building blocks. Initially, a 
plurality of template oligonucleotides Is provided. Subsequently primers are 

20 annealed to each of the templates and a polymerase is extending; the primer 
using nucleotide derivatives which have appended chemical entitles. 
Subsequent to or simultaneously with the incorporation of the nucleotide 
derivatives, the chemical entities are reacted to form a reaction product The 
encoded molecule may be posfcmodified by deaving some of the linking 

25 moieties to better present the encoded molecule. 

Several possible reaction approaches for the chemical entities are apparent 
First, the nucleotide derivatives can be incorporated and the chemical entitles 
subsequently polymerised. In the event the chemical entities each carry two 
. 30 reactive groups, the chemical entities can be attached to adjacent chemical 
entities by a reaction of these reactive groups. Exemplary of the reactive 
groups are amine and carboxylfc acid, which upon reaction form a'n amide 
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bond. Adjacent chemical entities can also be linked together using a linking 
or bridging moiety. Exemplary of this approach is the linking of two chemical 
entities each bearing an amine group by a bi-carboxylic acid. Yet another 
approach is the use of a reactive group between a chemical entity! and the 
5 nucleotide bunding block, such as an ester or a thioester group. Ah adjacent 
building block having a reactive group such as an amine may cleaVe the 
interspaced reactive group to obtain a linkage to the chemical entity, e.g. by 
an amide linking group. 1 

10 A second embodiment for obtainment of complexes pertains to the use of 
hybridisation of building blocks to an identifier oligonucleotide and reaction of 
chemical entities attached to the building blocks in order to obtain a reaction 
product This approach comprises that templates are contacted with a 
plurality of building blocks, wherein each building block comprises an anti- 

15 codon and a chemical entity. The anti-codons are designed such that they 
recognise a sequence, I.e. a codon, on the template. Subsequent to the 
annealing of the antl-codon and the codon to each other a reaction of the 
chemical entity is effected. < 

20 The template may be associated with a scaffold. Building blocks bringing 
chemical entities in may be added sequentially or simultaneously and a 
reaction of the reactive group of the chemical entity may be effected at any 
time after the annealing of the building blocks to the template. 

25 A third embodiment for the generation of a complex includes chemical or 

enzymaticai ligation of building blocks when these are lined up on a template. 
Initially, templates are provided, each having one or more codons. The 
templates are contacted with building blocks comprising anti-codons linked to 
chemical entitles. The two or more antt-codons annealed on a template are 
. 30 subsequently ligated to each other and a reaction of the chemical entities is 
effected to obtain a reaction product The method is disclosed in more detail 
in DK PA 2003 00430 filed 20 March 2003. 

■ * 
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A fourth embodiment makes use of the extension by a polymerase of an 
affinity sequence of the nascent complex to transfer the anti-codon of a 
building block to the nascent cqmplex. The method implies that a nascent 
5 complex comprising a scaffold and an affinity region is annealed tp a building 
block comprising a region complementary to the affinity section. . , 
Subsequently the anti-codon region of the building block is transferred to the 
nascent complex by a polymerase. The transfer of the chemical entity may 
be transferred prior to. simultaneously with or subsequent to the transfer of 
10 the anti-codon. This method Is disclosed in detail in DK PA 2002 01955 filed 
19 December 2002. 

After or simultaneously with the formation of the reaction product some of the 
linkers to the template may be cleaved, however at least one linker must be 
15 maintained to provide for the complex. 

Nucleotides 

The nucleotides used In the present Invention may be linked together In a 
sequence of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is 
20 normally composed of two parts, namely a nucleobase moiety, and a back- 
bone. The back bone may in some cases be subdivided into a sugar moiety 
and an internucleoside linker, j ; | 

' 'I 
The nucleobase moiety may be.selected among naturally occurring nucleo- 

25 bases as well as non-naturally occurring nucleobases. Thus, "nucleobase 11 
Includes not only the known purine and pyrimrdine hetero-cydes, but also 
heterocyclic analogues and tautomers thereof. Illustrative examples of nu- 
cleobases are adenine, guanine, thymine, cytosine, uracil,- purine, xanthine, 
diaminopurine, 8-oxo-N 6 -methyladenIne, 7-deazaxanthine, 7-deazaguanIne, 

30 N'.f^-ethanocytosin. N 6 1 N 8 -ethano-2,6-diamIno-purine l 5-methylcytosfne, 5- 
(C^C^-alkynylcytosine, 5-fluorouradl, 5-bromouradI, pseudoisocytoslne, 2- 
hydroxy-5-methyM-triazolopyridIne. isocytosine, isoguanfne, Inoslne and the 
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°non-naturally occurring" nuclebbases described in Benner et al. r U,S. Pat 
No. 5,432,272. The term "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucteo- 
bases are adenine, guanine, thymine, cytoslne, 5~methyIcytosine, and uracil, 
which are considered as the naturally occurring nucieobases in relation to 
therapeutic and diagnostic application in humans. 

Examples of suitable specific pairs of nucleofiasee are shown belbw: 
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Synmetfc purtnt bases (utrrtng with natural pyrlml dines 
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7-deaza guanine 



Suitable examples of backbone units are shown below (B denotes a nucleo- 
base): 
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DNA Oxy-LNA TWLMA Amino-INA 




The sugar moiety of the backbone is suitably a pentose but may be the ap- 
propriate part of an PNA or a six-member ring. Suitable examples of possible 
5 pentoses include ribose, 2-deoxyribose, 2'-0-methyt-ribose. 2 , -flour-ribose. 
and 2 f -4'-0-methylene-ribose (LNA). Suitably the nucleobase is attached to 
the V position of the pentose entity. 

An internudeoside linker connects the 3* end of preceding monomer to a 5' 
10 end of a succeeding monomer when the sugar moiety of the backbone is a 
pentose, tike ribose or 2-deoxy ribose. The internudeoside linkage may be 
the natural occurring phospodiester linkage or a derivative thereof. Exam- 
pies of such derivatives Include phosphorothioate, methylphosphbnate, 
phosphoramldate, phosphotriester. and phosphodithioata Furthermore, the 
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internucleoside linker can be any of a number of non-phosphoroiia-containing 
linkers known in the art 

i i 

Preferred nucleic acid monomers include naturally occurring nucleosides 
5 forming part of the DNA as well as the RNA family connected through phos- 
phodlester linkages. The members of the DNA family Include de- 
oxyadenosine, deoxyguanosine, deoxythymldlne, and deoxycytidine. The 
members of the RNA family Include adenosine, guanosine, uridine, cytidine, 
and inosine, tnosine is a non-specific pairing nucleoside and may be used as 
10 universal base because inosine can pair nearly isoenergetJcally with A, T, 
and C. Other compounds having the same ability of non-specifically base- 
pairing with natural nudeobases have been formed. Suitable compounds 
which may be utilized in the present invention includes among others the 
compounds depicted below 
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Examples of Universal Bases: 
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Building frlock 

The chemical entitles that are precursors for structural additions or elimlna- 
5 tlons of the encoded molecule may be attached to a building block prior to 
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the participation in the formation of the reaction product leading the final en- 
coded molecule. Besides the chemical entity, the building block generally 
comprises an anti-codon. 

5 The chemical entity of the building block comprises at (east one reactive 
group capable of participating In a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
a scaffold associated with the nascent complex The connection is facilitated 
by one or more reactive groups of the chemical entity. The number of reac- 

1 0 tive groups which appear on the chemical entity is suitably one to ten. A 
building block featuring only one reactive group is used i.a. in the end posi- 
tions of polymers or scaffolds, whereas building blocks having two reactive 
groups are suitable for the formation of the body part of a polymer or scaf- 
folds capable of being reacted further. One, two or more reactive groups in- 

1 5 tended for the formation of connections, are typically present on scaffolds. 

The reactive group of the building block may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
20 group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily* maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. \-\ 

25 The subsequent cleavage step to release the chemical entity from the build- 
ing block ran be performed In any appropriate way. In an aspect of the Inven- 
tion the cleavage involves usage of a reagent or an enzyme. The cleavage 
results in a transfer of the chemical entity to the nascent encoded molecule 
or in a transfer of the nascent encoded molecule to the chemical entity of the 
. 30 building block. In some cases it may be advantageous to introduce new 
chemical groups as a consequence of linker cleavage. The new chemical 
groups may be used for further reaction in a subsequent cycle, either directly 
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or after having been activated. In other cases it is desirable that no trace of 
the linker remains after the cleavage. 



In another aspect, the connection and the cleavage is conducted as a simul- 
5 taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction, in general, It Is 
preferred to design the system such that the connection and the cleavage 
occur simultaneously because this will reduce* the number of steps and the 
complexity. The simultaneous connection and cleavage can afso be designed 
1 0 such that either no trace of the linker remains or such that a new chemipai 
group for further reaction is introduced, as described above. 

The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 

15 cal entity can be attached to a nucleobase or the backbone. In general, 1 it is 
preferred to attach the chemical entity at the phosphor of the intemucleosfde 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
purines or 7-deaza-purfns or at the 5 position of pyrimidfnes. The nucleotide 

20 may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational space 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule or reactive site. 

25 The anticodon complements the codon of the identifier sequence and gener- 
ally comprises the same number of nucleotides as the codon. The antippdon 
may be adjoined with a fixed sequence, such as a sequence complementing 
a framing sequence. 

: i 

30 Various specific building blocks are envisaged. Building blocks of particular 
interest are shown below. - 
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Specific Building blocks • 

Building blocks transferring a chemical entity to a recipient nucleophllic group 
The building block indicated below is capable of transferring a chemical entity 
(CE) to a recipient nucleophllic group, typically an amine group. The bold 
5 lower horizontal lino Illustrates the building block and the vertical line illus- 
trates a spacer. The 5-membered substituted N-hydroxysucdnimid (NHS) 
ring serves as an activator, i.e. a labile bond te formed between the oxygen 
atom connected to the NHS ring and the chemical entity. The labile bond 
may be cleaved by a nucleophilic group, e.g. positioned on a scaffold 

10 




The 5-mernbered substituted N-hydroxysuccinimid (NHS) ring serves as an 
activator, i.e. a labile bond Is formed between the oxygen atom connected to 

15 the NHS ring and the chemical entity. The labile bond may be cleaved by a 
nucleophilic group, e.g. positioned on a scaffold, to transfer the chemical en- 
tity to the scaffold, thus converting the remainder of the fragment Into a leav- 
ing group of the reaction. When the chemical entity is connected to the acti- 
vator through an carbonyl group and the recipient group is an amine, the 

20 bond formed on the scaffold will an amide bond. The above building block is 
the subject of the Danish patent application No. PA 2002 01946 and the US 
provisional patent application No. 60/434,439, the content of which are incor- 
porated herein in their entirety by reference. 

25 Another building block which may form an amide bond is 
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R may be absent or N0 2 . CF 3 , halogen, preferably CI, Br, or 1, and Z may be 
S or O. This type of building block is disclosed in Danish patent application 
No. PA 2002 0951 and US provisional patent application filed 20 December 
5 2002 with the title W A building block capable of transferring a functional entity 
to a recipient reactive group 11 . The content of both patent application are in- 
corporated herein in their entirety by reference. 

t ! 

i •> 
( 

A nucleophllic group can cleave the linkage between Z and the carbonyl 
1 0 group thereby transferring the chemical entity -(C=0)-C£ > to said nucleo- 
phllic group. i 

Builtitng blocks transferring a chemical entity to a recipient reactive group 
forming a C=C bond 
15 A building block as shown below are able to transfer the chemical entity to a 
recipient aldehylde group thereby forming a double bond between the carbon 
of the aldehyde and the chemical entity 
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CE 



The above building block is comprised by the Danish patent application No. 
DK PA 2002 01952 and the US provisional patent application filed 20 De- 
5 cember 2002 with the title fl A building block capable of transferring a func- 
tional entity to a recipient reactive group forming a C=C double bond". The 
content of both patent applications are incorporated herein in their entirety by 



10 Building blocks transferring a chemical entity to a recipient reactive group 
forming a C-C bond 

The below building block Is able to transfer the chemical entity to a recipient 
group thereby forming a single bond between the receiving moiety, e.g.a 
scaffold, and the chemical entity. 



DK PA 2002 01 947 and the US provisional patent application No 60/434,428. 

The content of both patent applications are incorporated herein in their en- 

« 

20 tirety by reference. 



reference. 
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The above building block is comprised by the Danish patent application No. 



36 

Another building block capable of transferring a chemical entity to a receiving 
reactive group forming a single bond is 



O 



5 




The receiving group may be a nucleophile, such as a group comprising a 
hetero atom, thereby forming a single bond between the chemical entity and 
the hetero atom, or the receiving group may be an electronegative carbon 
10 atom, thereby forming a C-C bond between the chemical entity and the scaf- 
fold. 

The chemical entity attached to any of the above building blocks may be a 
selected from a large arsenal of chemical structures. Examples of chemical 

15 entities are :l 

H or entities selected among the group consisting of a (VC 6 alkyl, C2-C6 al- 
kenyl, Cz-Cs alkynyl. C4-C8 alkadienyl, C3-C7 cydoalkyl, C3-C7 cycloheteroal- 
kyl, aryl. and heteraaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 
0-3 R 9 or C^-Cg alkyiene-NR 4 2 . Ci-Ca alkylene-NR 4 C(0)R B , d-Ca al- 

20 kyfene-NR 4 C(0)OR a , C r C 2 alkylene-O-NR 4 ^ C r C 2 alkylene-O-NR 4 C(0)R 8 f 
C1-C2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 R*. 

where R 4 is H or selected independently among the group consisting 
of Ci-C 6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl. C3-C7 cydoalkyl, C3-C7 cyclohet- 
eroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R d and 

25 R 5 is selected independently from -N 3l -CNO, -C(NOH)NH 2 , -NHOH, 

-NHNHR 6 , -C(0)R 6 . -SnR B 3( -B<OR 6 )2, -P(0)<OR 6 )2 or the group consisting of 
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Cx-Ce alkenyl. C 2 -C 6 alkynyl. Ca-Cs alkadlenyl said group being substituted 
with 0-2 R 7 , 

where R 6 is selected Independently from H, Ci-Ce alkyl. C3.C7 cycloal- 

kyl, aryl or CrCe alkylene-aryl substituted with 0-5 halogen atoms selected 
5 from -F, -CI, -Br. and -I; and R 7 is Independently selected from -N0 2 , 

-COOR*. -COR e , -CN, -OSiR 6 3 , -OR e and -NR e 2 . 

R a is H, CrCe alkyl, Cz-C 9 alkenyl, C2-C6 alkynyl, C3-C7 eycloalkyj, aryl or 

Ci-Ca alkylene-aryl substituted with 0-3 substituents independently selected 

from -F, -CI, -N0 2 , -R 3 . -OR 3 , -SIR 3 3 
10 R 8 is=0,-F 1 -CI,-Br,-l.-CN.-N02,-OR e ,-NR B 2 ,-NR 6 -C(0)R B , 

-NR^OJOR 8 , -SR 8 , -S(0)R fl . -S(0)2R e , -COOR* -C(0)NR 6 2 and 

-S(0)2NR 6 2. 

Cross-link cleavage building blocks 
15 It may be advantageous to split the transfer of a chemical entity to a recipient 
reactive group into two separate steps, namely a cross-linking step and a 
cleavage step because each step can be optimized. A suitable building block 
for this two step process is illustrated below: 



20 




lnitally, a reactive group appearing on the functional entity precursor (abbre- 
viated FEP) reacts with a recipient reactive group, e.g. a reactive group ap- 
pearing on a scaffold, thereby forming a cross-link. Subsequently, a cleavage 
25 is performed, usually by adding, an aqueous oxidising agent such as l 2 , Br2, 
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CI21 H\ or a Lewis add. The cleavage results in a transfer of the group HZ- 
FEP- to the recipient moiety, such as a scaffold. 

In the above formula 
5 Z Is O f S, NR 4 

Q is N, CR 1 

P Is a valence bond, O, S, NR 4 , or a group Cs^arylene, d-ealkylene, 
d-eO-alkylene, d-sS-alkytene, NR 1 -alkylene!"Ci^alkyIene-O f d*atkylene-S 
option said group being substituted with 0-3 R 4 0-3 R s and 0-3 R 9 or C1-C3 
1 0 alkylene-NR 4 *, C1-C3 aikylene-NR 4 C(0)R B , d-C 3 a!kylene-NR 4 C(0)OR B , 
d-d a!kylene-0-NR 4 2| d-d aikyiene-0-NR 4 C(0)R B , d-d ai- 
kylene-O-NR 4 C(0)OR B substituted with 0-3 R* 
B is a group comprising D-E-F, In which 
D Is a valence bond or a group d-ealkylene, d-ealkenylene, d- 
15 ealkynylene, Cs-7arylene, or Cwheteroarylene, said group optionally being 
substituted with 1 to 4 group R 11 , 

E is, when present, a valence' bond, O. S, NR 4 , or a group Ci- 
ealkylene, Ci^alkenyJene, d^alkynylene, C^arylene, or Cwheteroarylene, 
said group optionally being substituted with 1 to 4 group R 11 . 
20 F is, when present, a valence bond, O, S, or NR 4 , 

A Is a spacing group distancing the chemical structure from the com- 
plementing element, which may be a nucleic acid, 

R 1 . R 2 , and R 3 are independent of each other selected among the 
group consisting of H, d-d alkyl, C;rd alkenyl, d-d alkynyl, d-Ce alkadi- 
25 enyl, d-C? cycloalkyl, d-d cydoheteroalkyl, aryl, and heteroatyl, said 
group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R° or d-d al- 
kylene-NR 4 2, d-d a»kylene-NR 4 C(6)R e , d-d alkylene-NR 4 C<0)ORVd-d 
alkylene-ONR 4 2l d-d alkylene-0-NR 4 C(0)R 8 , Ci-d al- 
, ky!ene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 
30 FEP is a group selected among the group consisting of H, d-d aJkyl, 

d-d alkenyl, d-d alkynyl. C 4 -C B alkadienyl, d-d cycloalkyl, d-d cyclo- 
heteroalkyl, aryl, and heteroaryl, said group being substituted with 0-3 R\ 0-3 
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R 6 and 0-3 R*or C^ 3 a!kylene-NR 4 2 , C r C 3 alkylene-NtfC^R* d-C 3 al- 
kylene~NR 4 C(0)OR 8 , Ci-C 2 alkylene-0-NR 4 2 , C1-C2 alkyiene-0-rtR 4 C(0)R 8 
Ci-C 2 alkylene-O-NR*C(O)0R* substituted with 0-3 R 9 , [ 

where R d is H or selected independently among the group consisting 
5 of Ci-C 6 alkyl f C2-C6 alkenyl, C2-C6 alkynyl, C3-C7 cycloalkyl, C3-C7 cyctohet- 
eroalkyi, aryl, heteroaryl, said group being substituted with 0-3 R° and 

R 5 is selected independently from -N 3 , -CNO, -C(IMOH)NH 2 , -NHOH, 
-NHNHR 6 , -C{0)R 6 , -SnR 6 3f -BfOR 6 )*, -P(0)(6R% or the group consisting of 
CrCe alkenyl, Cz-Ce alkynyl, C4-Ca alkadienyl said group being substituted 
10 with 0-2 R 7 r 

where R 6 is selected independently from H, Ci-Ce aikyl, C3-C7 cycloal- 
kyl, aryl or (VC B alkylene-aryl substituted with 0-5 halogen atoms selected 
from -F, •CI, -Br, and -I: and R 7 is Independently selected from -N0 2 , 
-COOR 6 , -COR 0 , -CN, -OSIR 6 *, -OR 6 and -NR* 2 . 
15 R* is H. Ci-C 6 alkyl, C2-C 8 alkenyl, C2-C6 alkynyl. C3-C7 cycloalkyl, aryl or 
Ci-Ce alkylene-aryl substitutediwith 0-3 substituents independently selected 
from -F, -CI, -NO*, -R 3 , -OR 3 ,iSiR 3 3 

R 9 is =0. -F, -CI. -Br, -I, -CN, -^JOa, -OR 6 , -NR 6 2t -NR 8 -C(0)R 8 . 1 
-NR 6 -C(O)0R 8 , -SR 6 , -S(0)R*, -SCOfeR 6 , -COOR 6 , -C(0)NR 6 a and 
20 -S(0>2NR 6 2. 

In a preferred embodiment Z is O or S, P is a valence bond, Q is CN, B is 
CH 2 , and R 1 B R a . and R 3 ls H. The bond between the carbonyl group and Z Is 
deavable with aqueous I2. 

25 

Partitioning 

The partitioning step, by which the library of bifuncfional molecules is sub- 
jected to a condition partitioning one or more complexes having ascertain 
property from the remainder of the library, may be referred to as the enrich- 
30 ment step or the selection step, as appropriate, and includes the^screening of 
the library for encoded molecules having predetermined desirable character- 
istics. Predetermined desirable Characteristics can include binding to a target, 
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catalytically changing the target, chemically reacting with a target In a man- 
ner which alters/modifies the target or the functional activity of the target, and 
covalently attaching to the target as in a suicide inhibitor. : » 

i-i 

5 In theory, molecules of interest can be selected based on their properties 
using either physical or physioi6gical procedures. The method preferred ac- 
cording to the present invention is to enrich molecules with respect to binding 
affinity towards a target of interest In a certain embodiment, the basic steps 
involve mixing the library of complexes with the immobilized target of interest 

10 The target can be attached to a column matrix or microtitre wells with direct 
immobilization or by means of antibody binding or other high-affinity interac- 
tions. In another embodiment, the target and displayed molecules interact 
without immobilisation of the target. Displayed molecules that bind to the tar- 
get will be retained by a filter, size-exclusion chromatography etc, while non- 

1 5 binding displayed molecules will be removed during a single or a series of 
wash steps. The identifiers of complexes bound to the target can then be 
separated by cleaving a physical connection to the encoded molecule or the 
entire complex may be eluted. It may be considered advantageously to per- 
form a chromatography step after or instead of the washing step. After the 

20 cleavage of the physical link between the synthetic molecule and t^ie identi- 
fier, the identifier may be recovered from the media and optionally amplified 
before the decoding step. 



A significant reduction in background binders may be obtained with increased 
25 washing volumes, repeating washing steps, higher detergent concentrations 
and prolonged Incubation during washing. Thus, the more volume and num- 
ber of steps used in the washing procedure together with more stringent con- 
ditions will more efficiently remove non-binders and background binders. The 
right stringency in the washing step can also be used to remove low-affinity 
30 specific binders. However, the washing step will also remove wanted binders 
if too harsh conditions are used. i ? 

. ■ !! 
i i 

• 5 n 
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A blocking step, such as incubation of solid phase with skimmed milk pro- 
teins or other inert proteins and/or mild detergent such as Tween-^20 and Tri- 
ton X-100, may also be used to reduce the background. The washing condi- 
tions should be as stringent aspossible to remove background binding but to 
5 retain specific binders that interact with the immobilized target Generally, 
washing conditions are adjusted to maintain the desired affinity binders, e.g. 
binders in the micromotor, nanbmolar, or pocomolar range. ; \ 

The target can be any compound of interest. E.g. the target can be a protein, 
10 peptide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, an- 
tigen, antibody, virus, substrate, metabolite, transition state analogue, cofac- 
tor, inhibitor, drug, dye, nutrient, growth factor, cell, tissue, etc. without limita- 
tion. Suitable targets include, but are not limited to, angiotensin converting 
enzyme, renin, cyclooxygenase, 5-Hpoxygenase, ML- 1 0 converting enzyme, 
15 cytokine receptors, PDGF receptor, type (I inosine monophosphate dehydro- 
genase, ^-lactamases, integrin, proteases like factor Vila, kinases like Bcr- 
Abl/Her, phosphotases like PTP-1B, and fungal cytochrome P-450. Targets 
can include.but are not limited to. bradykinln, neutrophil elastasej the HIV 
proteins, including fat, rev, gag, fnt RT. nucleocapsid etc. VEGFj bFGF, 
20 TGFP, KGF, PDGF, GPCR. thrombin, substance P, IgE, sPLA2, rjed blood 
cells, glioblastomas, fibrin clots* PBMCs, hCG« lectins, selectins. cytokines, 
ICP4, complement proteins, etd. * 

A target can also be a surface of a non-biological origin, such as a polymer 
25 surface or a metal surface. The method of the Invention may then be used to 
identify suitable coatings for such surfaces. 

in a preferred embodiment, the desirable synthetic molecule acts on the tar- 
get without any interaction between the nucleic acid attached to the desirable 
30 encoded molecule and the target. In one embodiment, the bound complex- 
target aggregate can be partitioned from unbound complexes by a number of 
methods. The methods Include nitrocellulose filter binding, column chroma- 

: -i 

H 

\ . ' a 
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tography, filtration, affinity chromatography, centrifugation, and other well 
known methods, A preferred method Is size-exclusion chromatography. 

Briefly, the library of complexes Is subjected to the target, which may Include 
5 contact between the library and a column onto which file target isj immobi- 
lised. Identifiers associated with undesirable encoded molecules, ;i.e. syn- 
thetic molecules not bound to the target under the stringency conditions 
used, will pass through the column. Additional undesirable encoded mole- 
cules (e.g. encoded molecules which cross-react with other targets) may be 

1 0 removed by counter-selection methods. Desirable complexes are bound to 
the column. The target may be immobilized In a number of ways. In one em- 
bodiment, the target is immobilized through a cleavable physical link, such as 
one more chemical bonds. The aggregate of the target and the complex may 
then be subjected to a size exclusion chromatography to separate the aggre- 

15 gate from the rest of the compounds in the media. The complex may then be 
eluted from the target by changing the conditions (e.g.. salt, pH, surfactant, 
temperature eta). Alternative^ the complex may be provided with a cleav- 
able linker, preferable orthogonal to a cleavable linker that attaches the target 
to the solid support, at a position between the synthetic molecule and the 

20 identifier. Subsequent to the size exclusion chromatography tills cleavable 
linker is cleaved to separate the Identifiers of complexes having affinity to- 
wards the targets. Just to mention a single type of orthogonal cleavable link- 
ages, one could attach the target to the solid support through a lirtkage that 
can be cleaved by a chemical agent, and the linker separating thd synthetic 

25 molecule and the identifier may be selected as a photocleavable -linkage. 
More specifically, the former linkage may be a disulphlde bond that ran be 
cleaved by a suitable reducing agent like DTT (dithiothreitol) and the latter 
linkage may be a o-nitrophenyi group. 



. 30 There are other partitioning and screening processes which are compatible 
with this invention that are known to one of ordinary skill in the art In one 
embodiment, the products can be fractionated by a number of common 

i 



hi 



Id/09 2003 XUK W.M KA* t *:} ifciv^p/u 

\ 
I 

43 

methods and then each fraction is assayed for activity. The fractionization 
methods can Include size, pH, hydrophoblcity, etc. 



inherent in the present method is the selection of encoded molecules on the 
5 basis of a desired function: this can be extended to the selection of mole- 
cules with a desired function and specificity. Specificity can be required dur- 
ing the selection process by first extracting complexes which are capable of 
Interacting with a non-desired "target" (negative selection, or coutirter- 
selection). followed by positive selection with the desired target. As an exam- 

1 0 pie, inhibitors of fungal cytochrome P-450 (fungicides) are known,to cross- 
react to some extent with mammalian cytochrome P-450 (resulting In serious 
side effects). Highly specific inhibitors of the fungal cytochrome cpuld be se- 
lected from a library by first removing those complexes capable of interacting 
with the mammalian cytochrome, followed by retention of the remaining 

15 products which are capable of interacting with the fungal cytochrome. 

Brief Description of the Figures 

Fig. 1 discloses two embodiments of using a Taqman probe (5' nuclease 
probe) in the measurement of the presence or absence of a certain codon. 
20 Fig. 2 discloses a standard curve used in example 1 . 

Fig. 3 discloses the result of two experiments reported in example 1 . 

Example 1 

A preferred embodiment of the invention utilizing a universal Taqman probe 
25 is shown in Fig.1 . Four codon are shown (P1 through P4; bold pattern) along 
with flanking regions (light pattern). A universal Taqman probe anneals to a 
region adjacent to the codon region, but within the amplicon defined by the 
universal PCR primers Pr.1 and Pr. 2. These primers could be th^ same as 
used for amplification of the identifier oligonucleotides encoding binders after 
. 30 an enrichment process on a specific target. However, are minimal length 
templates preferred during the encoding process, the region involved in 
Taqman probe annealing could be appended to the library identifier oligonu- 



cleotides by e.g. overlap PCR. ligation, or by employing a long downstream 
PGR primer containing the necessary sequences. The added length corre- 
sponding to the region necessary for annealing of the Taqman probe would 
be form 20 to 40 nts depending on the type of TaqMan probe and Ta of the 
5 PCR primers. The Q-PCR reactions are preferably performed in a 96- or 384- 
weli format on a real-time PGR thermocyciing machine. 

Fig. 1 A shows the detection of abundance of "a specific codon sequence in 
position one. Similar primers are prepared for all codon sequences. For each 
10 codon sequence utilized to encode a specific BB in the library a Q-PCR reac- 
tion is performed with a primer oligonucleotide complementary to Jhe codon 
sequence in question. A downstream universal reverse primer Pr..2 is pro- 
vided after the Taqman probe to provide for an exponential amplification of 
the PCR amplioon. The setup is most suited for cases where the podon con- 

1 5 stitutes a length corresponding to a length suitable for a PCR printer. 

i 

» " i 

Fig. 1B shows the detection of abundance of a specific codon sequence in a 
specific codon position using a primer which Is complementing a codon and a 
framing sequence. Similar primers are used for all the codons and framing 

20 sequences. For each codon sequence utilized to encode a specific BB at a 
specific codon position in the library a Q-PCR reaction is performed with an 
oligo complementary to the codon sequence in question as well as a short 
region up- or downstream of the codon region which ensures extension of the 
primer in a PCR reaction only when annealed to the codon sequence in that 

25 specific codon position. The number of specific primers and Q-PCR reactions 
needed to cover all codon sequences in ail possible codon posltiohs equals 
the number of codon sequences times the number of codon positions. Thus, 
monitoring the abundance of 96j different codon sequences in 4 different po- 
sitions can be performed in a single run on four 96 wells micro titre plates (as 

30 shown In Fig. 1 B) or a single 384 well plate on a suitable instrument. This 
architecture allows for the deooding of a 8,5 *10 7 library of different encoded 
molecules. 
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Quantification Is performed relative to the amount of full-length PGR product 
obtained In a parallel control reaction on the same input material performed 
with the two external PGR primers Pr.1 + Pr. 2. Theoretically, a similar rate of 
5 accumulation of this control amplicon compared to the accumulation of a 
product utilizing a single codon + sequence specific primer would Indicate a 
100% dominance of this particular sequence in the position In question. 

Although the setups shown In Fig. 1A and 1B employ a Taqman probe strat- 
1 0 egy, other detection systems (SYBR green. Molecular Beacons etc.) could be 
utilized. In theory, multiplex reactions employing up to 4 different fluorofors in 
the same reaction could increase throughput correspondingly. 

An example of how a deconvolution process of a library of encoded mole- 

15 cules occurs Is described in the following. Imagine that at the end jof a selec- 
tion scheme a pool of 3 ligand families (and the corresponding coding tem- 
plates) are dominating the population and present at approx. the Same con* 
centration. Three different chemical entities are present in the first position of 
the encoded compounds, and each of these chemical entities are present in 

20 combination with one unique chemical entity out of 3 different chemical enti- 
ties in position P2. Only one chemical entity In position 3 gives rise to active 
binders, whereas any of a 20% subset of chemical entities (e.g. determined 
by charge, size or other characteristic) are present In position 4. The out- 
come of the initial codon profile analysis would be: 3 codon sequences are 

25 equally dominating in position P1, 3 other codon sequences in position P2, 1 
unique codon sequence is dominant in P3 whereas somewhat similarly In- 
creased levels of 20% of the codon sequences (background levels of the re- 
maining 80% sequences) are seen in P4. In such cases it could be relevant 
to use an iterative Q-PCR (*IQ-PCR n ) strategy to perform a furthef deconvo- 

30 lution of a library after selection^ Again with reference to the example above, 
by taking the PCR products from the 3 individual wells that contained primers 
giving the high yields in position? P1, diluting the product appropriately and 

i 
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performing a second round of Q-PCR on each of these identifier oligonucleo- 
tides separately, it would be possible to deduce which codon sequence(s) is 
preferred in P2 when a given codon sequence is present in P1 . 

i !;i 

t - M 
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Synthesis of Identifier oligonucleotides: 

The 10 identifier oligonucleotides were assembled in 10 seperate 50 pi PCR 
reactions each containing 0.05 pmol of the oligos Q-Temp1-X, Q-Temp2, Q- 
Temp3-X and Q-Temp4 (x=1 through 10) and 25 pmol of the external prim- 
5 ere FPv2 and RPv2 with TA- 53°C. The 160 bp products were gel-purified 
using QiAqulck Gel Extraction Kit from QIAGEN (Cat. No. 28706) and quanti- 
fied on spectrophotometer. As a control, 20 ng of each of the templates (as 
estimated from these measurements) were loaded on an agarose gel. 

10 Preparation of samples for Q-PCR: 

Sample A: Generated by mixing 20 ng from each identifier oligonucleotide 
prep. Volume was adjusted to 50 pi. Concentration: 4 ng/pl = 38.46 fmol/pl 
(160bp x 650 Da/bp =1 .04x105 g/mol. 1 ng= 9.61 5 fmol). Diluted to 10 7 cop- 
ies/5ul (0.00332 fmol/ul). 

15 

Sample B: 20 ng/20pl stocks of each template were prepared. The sample 

was mixed as follows: 

5pl undil. Template #10 

5pl 2x dil. Template #9 
20 5pl 4x dil. Template #8 

5pl 6x dil. Template #7 

5pl 16x dil. Template #6 

5pl 32x dil. Template #5 

5pl 64x dil. Template #4 
25 5pl 128x dil. Template #3 

5pl 256x dil. Template #2 

5pl 512x dil. Template #1 ; $ 
Concentration: 1 0ng/50pl= 0.20 ng/pl = 1 .923 fmol/pl. Diluted 579.2-fold to 
10 7 copies/5pl (0.00332 fmol/pl). 

.30 

Standard curve: The samples for the standard curve was prepared by diluting 
Sample A 1 16.55-fold to 10° copies/5 pi (0.33 fmol//pl) and subsequently per- 
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forming a 10-fold serial dilution of this sample. 5 Ml was used for each PCR 
reaction. The standard curve is-shown in Fig. 2. 

Q-PCR reactions 

For 5 ml premix (for one 96-well plate): 

2.5 ml Taqman Universal PCR Master Mix (Applied Biosystems; Includes Taq 
polymerase, dNTPs and optimized Taq pol. buffer) 
450 pi RPv2 (10 pmol/ul) K ' 

25 pi Taqman probe (6-FAM-TCCAGCTTCTAGGAAGAC-MGBNFQ; 50 pM; 
Applied Biosystems) 
1075 pi H2Q 



40.5 pi premix was allquoted into each well and 4.5 pi of relevant upstream 
PCR primer (FPv2 (for standard curve) or one of the codon specific primers 
listed below; 10 pmol/pl) and 5 pi sample (H20 in wells for negative controls) 
was added. The codon-specific'PCR primers were: (Tm calculations shown 
are from Vector NTI; matched to Tm for RPv2 (67.7°C)) 

P1-1: GTCATACTAGCTGCTAGAGATGTGGTGATA 66.8°C 

P1-2: CATACGGAAGAAGACAGAAGACCTGATA 67.8'C 

P1-3: TCATACTCAGGAGTCGAGAACTGAAGATA 67.6°C 

P1-4: CATACTGTGTACGTCAACACGTCAGATA 67.4°C 

P1-5: CATACTGTGGAACTACCATCCAAGGATA 68.0°C 

P1-6: CCATCCAACATCGTTGGAAGAT 67.8°C 

P1-7: CATACAACCTGTCCTGTGAGATCTGATA 67.7'C 

P1-8: ATACTCACGAAGCTGGATGATGAGATA 67.3°C 

P1-9: CATACTAGCATCGATCGAACGTAGGATA 68.1°C 

P1-10: TCATACTCGAAGCTACTGTCGAGATGATA 68.2°C « 

P2-1: ATATTAGTGTGTGACGATGGTACGCA 67.8°C I 

P3-1: ACAAGTACGAACGTGGATCAGAGA 67.7°C i- 

P4-1: CGAGCAGGACCTGGAACCT 67.7»C $ 

P4-2: TCGACCACTGCAGGTGGA 68.3°C 



25 
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P4-3: GCTTCCTCTGCTGCACCA 66.7°C 

P4-4: GGTGTCGAGGTGAGCAGCA 69.1'C 

P4-5: CGACGAGGTCCATCCTGGT 68.6°C 

P4-6: GTGAGGAGCAGGTCCTCCTGT 68.0°C 

5 P4-7: CTGACACTGGTCGTGGTCGA 68.8°C 

P4-0: CATCTCGACGACCTGCTCCT 67.9°C 

P4-9: ACGAGGTCTCCACTGGTCCA 6B.3°C 
P4-10:ACTGAGCTGCTCCTCCAGGT v 66.5°C 

1 0 Thermocycling/measurement of fluoresence was performed on an Applied 

Biosystems ABl Prism 7900HT real-time Instrument utilizing the standard 

cycling parameters: 

QS'C 10 mln; 

40 cycles of 
15 95°Cl5sec; 

60 e C 1 min 

; i 

All samples were run in duplicate. , 

20 Rgsyi& { 

Fig. 2 shows the standard curve calculated by the 790QHT system software. 
The log of the starting copy number was plotted against the measured Cy 
value. The relationship betweeh C T and starting copy number was linear In 
the range from 10 to 10 9 template copies. 



This standard curve was utilized by the system software to calculate the 
quantity in the "unknown 0 samples as shown below. 
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Table I: Sample A (Shown graphically In Flo. 3A 
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193310o,o0 


2085476,40 


1000000 


P4-4 


1359684,40 


1364621,40 


1000000 


P4-5 


2206709,80 


2065613,60 


1000000 


P4-6 


1652718.10 


1873777,20 


1000000 


P4-7 


1468208,10 


1416153,00 


1000000 


P4-8 


1664467,50 


1581067.00 


1000000 


P4-9 


1462520,60 


1594593,80 


1000000 


P4-10 


2020088.20 


1912277.40 


1000000 



Table II: Sample B (Shawn graphically in Fig. 38) 
Sample Observed 
B:2-fbld Observed A B Expected 
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dii. 








FPv2 


4.97E+06 


5.05E+06 


10000000 


P1-1 


9955,07 


10899,97 


9765,625 


P1-2 


12732 32 


13469.12 


19531,25 


P1-3 


25542,8 


25419,85 


39062,5 


P1-4 


34740,09 


44070,01 


76125 


P1-5 


110881,41 


123734,13 


156250 


P1-6 


163687,44 


166220,5 


312500 


P1-7 


156993,81 


172005,64 


625000 


P1-8 


343176,78 

W^ W^r • f W, W W 


37:4809,13 


1250000 


P1-9 


646619,44 


576151 


2500000 


P1-10 


1 49E+06 

> p ■ Vr ■ w%* 


1.72E+06 


5000000 


P2-1 


5 19E+06 


5 37E+06 


10000000 


P3-1 
■ w i 


5 29E+06 


5 09E+06 

W, W w ^™ WW 


.10000000 


P4-1 


/no s la naH 


70223,8 


9765.625 


P4-2 


42103.32 


22733.17 


19531,25 


P4-3 


54480.62 


39663,62 


39062,5 


OA A 




4QQCQ Q 


76125 


P4-5 


137946,95 


115027,34 


156250 


P4-8 


174134,64 


156442,55 


312500 


P4-7 


316505.78 


283856,84 


625000 


P4-8 


737661,44 


691296,75 


1250000 


P4-9 


1.42E+06 


1.45E+06 


2500000 


P4-10 


3.72E+06 


3.52E+06 


5000000 



The results of the experiments show the possibility of accurately quantifica- 
tion of identifier oligonucleotides down to or even below 10 copies with a 9 
4 5 fold dynamic range, and reliable relative quantification of the tested codons in 
various positions in the identifier oligonucleotide. 
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While the invention has been described with references to specific methods 
and embodiments, it will be appreciated that various modifications and 
changes may be made without departing from the Invention. All patent and 
literature references cited herein are hereby Incorporated by reference in 
5 their entirety. 
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Claims 

1- A method for obtaining structural information about an encoded mole- 
cule produced by a process comprising reaction of a plurality of chemical en- 
tities, said encoded molecule being capable of forming part of a complex also 
5 comprising an identifier oligonucleotide containing codons informative of the 
identity or chemical entitles which have participated In the formation of the 
encoded molecule, the method comprises 

a) mixing a primer oligonucleotide with the identifier oligonu- 
cleotide, 

10 b) subjecting the mixture to a condition allowing for an exten- 

sion reaction to occur when the primer is sufficient complemen- 
tary to a part of the identifier oligonucleotide, and 
c) evaluating, based on measurement of the.extension reaction, 
the presence, absence, or relative abundance of one or more 

15 codons. 

2. The method according to claim 1 , wherein a composition of one, two, 
or more identifier oligonucleotides are processed simultaneously: : 

3. The method according to daim 2, the composition is a result of sub- 
jecting a library of different complexes to a condition partitioning one or more 

20 complexes having a certain property from the remainder of the library. 

4. The method according to claim 1 , wherein the condition which allows 
for an extension reaction to occur includes a polymerase or a llgase as well 
as suitable substrates. 

5. The method according to claim 4, wherein the condition includes a po- 
25 lymerase and a substrate comprising a blend of (deoxy)ribonucleotide 

triphosphates. 

6. The method according to any of the preceding claims, wherein the 
chemical entities are precursors for structural units appearing in the encoded 
molecule. 

30 7. The method according to any of the claims 1 to 6, wherein the process 
of producing the one or more encoded molecules comprises transferring one 
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or more chemical entities to a nascent encoded molecule by a building block 
which further comprises an antl-codon. 

8. The method of claim 7, wherein the information of the anti-eodon is 
transferred in conjunction with the chemical entity to the nascent fehcoded 

5 molecule. 1 

9. The method according to any of the preceding claims 1 to 8, wherein 
the Identifier comprises two or more codons. 

10. The method according to any of the preceding claims 1 to 8, wherein 
the identifier comprises three or more codons. 

10 11. The method according to any of the preceding claims, wherein 

neighbouring codons of the identifier are spaced by a framing sequence. 

1 2. The method according to claim 1 1 . wherein the framing sequence po- 
sitions the reaction of a chemical entity in the synthesis history of the en- 
coded molecule. 

15 13. The method according to any of the claims 1 to 12. wherein at least a 

part of the primer is complementary to a codon. 1 1 

14. The method according to claims 1 to 13, wherein at least a part of the 
primer is complementary to a cqdon and an adjacent framing sequnece. 

15. The method according to any of the claims 1 to 13, wherein! the 
20 codons have a length of four or more nucleotides. 

16. The method according to any of the claims 1 to 15, wherein the se- 
quence comprising the codon and an adjacent framing sequence has a total 
length of 1 1 nucleotides or more. 

17. The method according to any of the claims 1 to 16, wherein the exten- 
25 sion reaction Is measured using the polymerase chain reaction (PCR), 

wherein the primer of claim 1 is involved in said PCR. 

18. The method according to any of the claims 1 to 17, wherein a primer is 
labelled. 

1 9. The method according to claim 1 8, wherein the primer Is labelled with 
30 a small molecule, a radio active component, or a fiuorogenic molecule. 

20. The method according to.claim 19. wherein the small molecule label is 
selected from biotin, dinitrophenpl, and digoxlgenln, and the PCR cjmplicons 
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are detected using an enzyme labelled streptavidin, anti-dinitrophenoL or 
anti-digoxigenln, respectively, reporter molecule. 

21. The method according to any of* the claims 1 to 19, whereto extension 
reaction is measured by real-time PGR. • j 

5 22. The method according to dalm 21 . wherein the real-time PGR in- 

volves the use of an oligonucleotide probe responsible for the generation or a 
detectable signal during the propagation of the PGR reaction. 

23. The method according to any of the claims 1 to 21 ..wherein the probe 
is designed to hybridise at a position downstream of a primer binding site. 
10 24. The method according to claim 22 or 23 , wherein the probe is a 5' nu- 

clease oligoprobe or a hairpin oligoprobe. 

25. The method according to claims 2 or 3, wherein the library comprises 
complexes with Identifier oligonucleotides having n codon positions and the 
codons in said codon positions being selected from a set of m different 

15 codons. 

26. The method according to claim 25, wherein a framing sequence is re- 
lated to each of the n codon positions In a particular complex, said framing 
sequence positions the reaction of a chemical entity in the synthesis history . 
of the encoded molecule* j ■ t 

20 27. The method according to claim 25, wherein each codon in the set of m 

different codons differs from any other codons in the set in at least two 
nucleotide positions. 

28. The method according to claim 26, wherein each framing sequence in 
a set of n different framing sequences differs from any other framing se- 

25 quences in the set In at least two nucleotide positions. 

29. A method for identifying the chemical entities utilized in the formation 
of an encoded molecule or a composition of encoded molecules, wherein In 
separate compartments, n x m primers Individually are mixed with an aliquot 
of a composition obtained by subjecting a library of different complexes to a 

30 condition partitioning said composition from the remainder of the library, sub- 
jected to a mixture of polymerase and substrate (deoxy)ribonudeotide 
triphosphates under conditions allowing for an extension reaction to occur 

■ : i 
i .1 
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when a primer is sufficient complementary to a part of one or more identifier 
oligonucleotides present in the aliquot, and evaluation, based on measure- 
ment of the extension reaction, the presence, absence, or relative abundance 
of one or more codons in each compartment. 

5 30, A set comprising a collection of oligonucleotide primers, a 1 polymerase, 
a composition of (deoxy)ribunucleotlde' triphosphates (dNTPs), ajid a library 
of complexes composed of a display molecule part and an identifier oligonu- 
cleotide, said oligonucleotide comprising codfcns informative of the identity of 
the chemical entities which ha£ participated in the formation of the display 

1 0 molecule, wherein the oligonucleotide primers are sufficient complementary 
to codons appearing on the identifier oligo nucleotides in the library to allow 
for an extension reaction to occur. 

31. An encode molecule identified by a method according to claims 1 to 
28. 

15 



■I 
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Abstract 

The present invention relates to a method for obtaining structural information 
about an encoded molecule. The encoded molecule may be produced by a 
reaction of a plurality of chemical entitles and may be capable of being con- 
nected to an identifier oligonucleotide containing codons informative of the 
identity or the chemical entities which have participated in the formation of 
the encoded molecule. In a certain embodiment, primers are designed com- 
plementary to the codons appearing on the Identifier oligonucleotide, and the 
presence, absence or relative abundance of a codon is evaluated by mixing a 
primer with the Identifier oligonucleotide in the presence of a polymerase and 
substrate (deoxy)ribonucleotide triphosphates and measuring the extension 
reaction. 
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Sample A: Equimolar ratios 



1 .nnF+ns 

1.00E+07 f 
1.00E+06 
1.00E+05 
1.00E+04 
1.00E+03 
1.00E+02 
1.00E+01 
1.00E+00 



* A • •ft- 0 ■ P ' t ft 



8 * 1 8 I 1 i ? 1 - 



-t— f I — I — I — I — I — I — hH — I — i — I — I — I — I — ' — I — I 



♦ Observed 1 
» Observed 2 
a Expected 



Ffg 3B Sample B 



10OBrO5 
1JOOH-K34 

1JOOE403 
IJQQfrKE 
1.00E-KJ1 



1JB0E-H30 



* « » 



* 9 



VTT 
a 



i i • 



-1 1 1 1 I I I 



-I — I I I 



